Get our free extension to see links to code for papers anywhere online!Free add-on: code for papers everywhere!Free add-on: See code for papers anywhere!

Add to Chrome

Add to Firefox

Add to Edge

Frederick Jelinek

CLSP, The Johns Hopkins University

Structured Language Modeling for Speech Recognition

Jan 25, 2000

Ciprian Chelba, Frederick Jelinek

Figure 1 for Structured Language Modeling for Speech Recognition

Figure 2 for Structured Language Modeling for Speech Recognition

Figure 3 for Structured Language Modeling for Speech Recognition

Figure 4 for Structured Language Modeling for Speech Recognition

Abstract:A new language model for speech recognition is presented. The model develops hidden hierarchical syntactic-like structure incrementally and uses it to extract meaningful information from the word history, thus complementing the locality of currently used trigram models. The structured language model (SLM) and its performance in a two-pass speech recognizer --- lattice decoding --- are presented. Experiments on the WSJ corpus show an improvement in both perplexity (PPL) and word error rate (WER) over conventional trigram models.

* Proceedings of NLDB'99, Klagenfurt, Austria
* 4 pages + 2 pages of ERRATA

Via

Access Paper or Ask Questions

Expoiting Syntactic Structure for Language Modeling

Jan 25, 2000

Ciprian Chelba, Frederick Jelinek

Figure 1 for Expoiting Syntactic Structure for Language Modeling

Figure 2 for Expoiting Syntactic Structure for Language Modeling

Figure 3 for Expoiting Syntactic Structure for Language Modeling

Figure 4 for Expoiting Syntactic Structure for Language Modeling

Abstract:The paper presents a language model that develops syntactic structure and uses it to extract meaningful information from the word history, thus enabling the use of long distance dependencies. The model assigns probability to every joint sequence of words--binary-parse-structure with headword annotation and operates in a left-to-right manner --- therefore usable for automatic speech recognition. The model, its probabilistic parameterization, and a set of experiments meant to evaluate its predictive power are presented; an improvement over standard trigram modeling is achieved.

* Proceedings of ACL'98, Montreal, Canada
* changed ACM-class membership and buggy author names

Via

Access Paper or Ask Questions

Recognition Performance of a Structured Language Model

Jan 24, 2000

Ciprian Chelba, Frederick Jelinek

Figure 1 for Recognition Performance of a Structured Language Model

Figure 2 for Recognition Performance of a Structured Language Model

Figure 3 for Recognition Performance of a Structured Language Model

Figure 4 for Recognition Performance of a Structured Language Model

Abstract:A new language model for speech recognition inspired by linguistic analysis is presented. The model develops hidden hierarchical structure incrementally and uses it to extract meaningful information from the word history - thus enabling the use of extended distance dependencies - in an attempt to complement the locality of currently used trigram models. The structured language model, its probabilistic parameterization and performance in a two-pass speech recognizer are presented. Experiments on the SWITCHBOARD corpus show an improvement in both perplexity and word error rate over conventional trigram models.

* Proceedings of Eurospeech, 1999, pp. 1567-1570, Budapest, Hungary
* 4 pages

Via

Access Paper or Ask Questions

Refinement of a Structured Language Model

Jan 24, 2000

Ciprian Chelba, Frederick Jelinek

Figure 1 for Refinement of a Structured Language Model

Figure 2 for Refinement of a Structured Language Model

Figure 3 for Refinement of a Structured Language Model

Figure 4 for Refinement of a Structured Language Model

Abstract:A new language model for speech recognition inspired by linguistic analysis is presented. The model develops hidden hierarchical structure incrementally and uses it to extract meaningful information from the word history - thus enabling the use of extended distance dependencies - in an attempt to complement the locality of currently used n-gram Markov models. The model, its probabilistic parametrization, a reestimation algorithm for the model parameters and a set of experiments meant to evaluate its potential for speech recognition are presented.

* Proceedings of the International Conference on Advances in Pattern Recognition, 1998, pp. 275-284, Plymouth, UK
* 10 pages

Via

Access Paper or Ask Questions